Efficient ICCG on a Shared Memory Multiprocessor
نویسندگان
چکیده
In this paper we discuss different approaches for exploiting parallelism in the ICCG method for solving large sparse symmetric positive ,lefinite systems of equations on a shared memory parallel computer. Techniques for efficiently solving triangular systems and computing sparse matrix-vector prodm:ts are explored. Three methods for scheduling the tasks in solving triangular systems are implemented on the Sequent Balance 21000. Sample problems that are row resentative of a large class of problems solved using iterat ive met hods are used. We show that a static analysis to determine data depen,t_mces in the triangular. solve can greatly improve its parallel efficiency. We also show that ignoring symmetry and storing the whole matrix can reduce solution time substantially. "submitted to: International Journal of High Speed Computing. tPh.D. Student at Rensselaer Polytechnic Institute, Troy, NY 12180 and Visiting Research Associate at Research Institute for Advanced Computer Science, NASA Ames Research Center. Moffett Field, CA 940:35. iResearch Institute for Advanced Computer Science, NASA Ames Research (?enter. Moffett Field. CA 94035. °Work reported herein was supported by Cooperative Agreement .N'('C'2-187 between the National .\eronantics and Space Administration (NASA) and the Universities Space Research Association {USRA)..
منابع مشابه
Parallel ICCG on a Hierarchical Memory Multiprocessor- Addressing the Triangular Solve Bottleneck bY
The incomplete Cholesky conjugate gradient (ICCG) algorithm is a commonly used iterative method for solving large sparse systems of equations. In this paper, we study the parallel solution of sparse triangular systems of equations, the most difficult aspect of implementing the ICCG method on a multiprocessor. We focus on shared-memory multiprocessor architectures with deep memory hierarchies. O...
متن کاملParallel ICCG on a hierarchical memory multiprocessor - Addressing the triangular solve bottleneck
The incomplete Cholesky conjugate gradient (ICCG) algorithm is a commonly used iterative method for solving large sparse systems of equations. In this paper, we study the parallel solution of sparse triangular systems of equations, the most difficult aspect of implementing the ICCG method on a multiprocessor. We focus on shared-memory multiprocessor architectures with deep memory hierarchies. O...
متن کاملShared Memory Versus Message Passing for Iterative Solution of Sparse Irregular Problems
The benefits of hardware support for shared memory versus those for message passing are difficult to evaluate without an in-depth study of real applications on a common platform. We evaluate the communication mechanisms of the MIT Alewife machine, a multiprocessor which provides integrated cache-coherent shared memory, message passing, and DMA. We perform this evaluation with “best-effort” impl...
متن کاملA Scaleable Multiprocessor Architecture with Multiple Read-Write Memory Model
This paper presents a scalable multiprocessor architecture with multiple access memories and multi-way busses. This parallel architecture with more intelligent memory model and efficient multi-way interconnection network organization is called as CRrCW (Concurrent Read and restricted Concurrent Write) scaleable multiprocessor system. The memory and network model provides concurrent memory acces...
متن کاملalgorithms for Dynamic Software Cache Coherence
[19] L. Lamport. How to make a multiprocessor computer that correctly executes multiprocess programs. The directory-based cache coherence protocol for the DASH multiprocessor. In 17th Int'l. [24] H. Sandhu. Shared Regions-A strategy for efficient cache management in shared-memory multi-processors.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- International Journal of High Speed Computing
دوره 4 شماره
صفحات -
تاریخ انتشار 1992